AITopics

2502.11273

Country: North America > United States > California (0.67)

Genre:

Research Report > New Finding (0.93)
Personal (0.93)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Law (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.67)

Kouzy, Ramez, Attar-Olyaee, Roxanna, Rooney, Michael K., Hassanzadeh, Comron J., Li, Junyi Jessy, Mohamad, Osama

QuaLLM-Health: An Adaptation of an LLM-Based Framework for Quantitative Data Extraction from Online Health Discussions

arXiv.org Artificial IntelligenceNov-26-2024

Health-related discussions on social media like Reddit offer valuable insights, but extracting quantitative data from unstructured text is challenging. In this work, we present an adapted framework from QuaLLM into QuaLLM-Health for extracting clinically relevant quantitative data from Reddit discussions about glucagon-like peptide-1 (GLP-1) receptor agonists using large language models (LLMs). We collected 410k posts and comments from five GLP-1-related communities using the Reddit API in July 2024. After filtering for cancer-related discussions, 2,059 unique entries remained. We developed annotation guidelines to manually extract variables such as cancer survivorship, family cancer history, cancer types mentioned, risk perceptions, and discussions with physicians. Two domain-experts independently annotated a random sample of 100 entries to create a gold-standard dataset. We then employed iterative prompt engineering with OpenAI's "GPT-4o-mini" on the gold-standard dataset to build an optimized pipeline that allowed us to extract variables from the large dataset. The optimized LLM achieved accuracies above 0.85 for all variables, with precision, recall and F1 score macro averaged > 0.90, indicating balanced performance. Stability testing showed a 95% match rate across runs, confirming consistency. Applying the framework to the full dataset enabled efficient extraction of variables necessary for downstream analysis, costing under $3 and completing in approximately one hour. QuaLLM-Health demonstrates that LLMs can effectively and efficiently extract clinically relevant quantitative data from unstructured social media content. Incorporating human expertise and iterative prompt refinement ensures accuracy and reliability. This methodology can be adapted for large-scale analysis of patient-generated data across various health domains, facilitating valuable insights for healthcare research.

annotation guideline, dataset, quallm-health, (11 more...)

2411.17967

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > Texas > Harris County > Houston (0.04)
North America > United States > Texas > El Paso County > El Paso (0.04)

Genre: Research Report > New Finding (0.47)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

arXiv.org Artificial IntelligenceJul-21-2023

Framework for developing quantitative agent based models based on qualitative expert knowledge: an organised crime use-case

Oetker, Frederike, Nespeca, Vittorio, Vis, Thijs, Duijn, Paul, Sloot, Peter, Quax, Rick

In order to model criminal networks for law enforcement purposes, a limited supply of data needs to be translated into validated agent-based models. What is missing in current criminological modelling is a systematic and transparent framework for modelers and domain experts that establishes a modelling procedure for computational criminal modelling that includes translating qualitative data into quantitative rules. For this, we propose FREIDA (Framework for Expert-Informed Data-driven Agent-based models). Throughout the paper, the criminal cocaine replacement model (CCRM) will be used as an example case to demonstrate the FREIDA methodology. For the CCRM, a criminal cocaine network in the Netherlands is being modelled where the kingpin node is being removed, the goal being for the remaining agents to reorganize after the disruption and return the network into a stable state. Qualitative data sources such as case files, literature and interviews are translated into empirical laws, and combined with the quantitative sources such as databases form the three dimensions (environment, agents, behaviour) of a networked ABM. Four case files are being modelled and scored both for training as well as for validation scores to transition to the computational model and application phase respectively. In the last phase, iterative sensitivity analysis, uncertainty quantification and scenario testing eventually lead to a robust model that can help law enforcement plan their intervention strategies. Results indicate the need for flexible parameters as well as additional case file simulations to be performed.

agent, artificial intelligence, case file, (17 more...)

2308.00505

Country:

Europe > Netherlands > South Holland > Rotterdam (0.14)
Europe > Netherlands > North Holland > Amsterdam (0.05)
South America > Colombia (0.04)
(4 more...)

Genre:

Personal > Interview (0.47)
Research Report > New Finding (0.46)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.46)

#artificialintelligenceJul-26-2022, 04:46:03 GMT

Natural Language Processing: The Technology That's Biased

Natural Language Processing (NLP) refers to building machines that can understand and respond to voice data with their own text and speech. Natural Language Processing falls under the umbrella of Artificial Intelligence (AI) and recent models like the Bidirectional Encoder Representations from Transformers (BERT), Generative Pre-Trained Transformer 3 (GPT-3) and Pathways AI Language Models (PaLM) have made accurate human-machine communication possible. These Large language Models (LLMs) are trained on massive volumes of text with billions of parameters and are able to understand and answer reading comprehension questions as well as generating new text such as a summary. Put simply, LLMs are trained to predict the next words in a sentence, such as by extending the autocomplete feature in messaging applications. But they can do much more, for example question answering, translation, image captioning, human-level dialogue agents, entity linking, or even data cleaning (for mixes of structured and unstructured data). NLP is already being used to automate some human tasks (RPA – robotic process automation), however the breath-taking advances in the last 3 years, NLP open new potential for businesses to digitize company knowledge and disrupting incumbent business models.

natural language processing, nlp, semantic interpretation, (7 more...)

Industry: Education > Assessment & Standards > Student Performance (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Rizzo, Lucas, Longo, Luca

Comparing and extending the use of defeasible argumentation with quantitative data in real-world contexts

arXiv.org Artificial IntelligenceJun-28-2022

Dealing with uncertain, contradicting, and ambiguous information is still a central issue in Artificial Intelligence (AI). As a result, many formalisms have been proposed or adapted so as to consider non-monotonicity, with only a limited number of works and researchers performing any sort of comparison among them. A non-monotonic formalism is one that allows the retraction of previous conclusions or claims, from premises, in light of new evidence, offering some desirable flexibility when dealing with uncertainty. This research article focuses on evaluating the inferential capacity of defeasible argumentation, a formalism particularly envisioned for modelling non-monotonic reasoning. In addition to this, fuzzy reasoning and expert systems, extended for handling non-monotonicity of reasoning, are selected and employed as baselines, due to their vast and accepted use within the AI community. Computational trust was selected as the domain of application of such models. Trust is an ill-defined construct, hence, reasoning applied to the inference of trust can be seen as non-monotonic. Inference models were designed to assign trust scalars to editors of the Wikipedia project. In particular, argument-based models demonstrated more robustness than those built upon the baselines despite the knowledge bases or datasets employed. This study contributes to the body of knowledge through the exploitation of defeasible argumentation and its comparison to similar approaches. The practical use of such approaches coupled with a modular design that facilitates similar experiments was exemplified and their respective implementations made publicly available on GitHub [120, 121]. This work adds to previous works, empirically enhancing the generalisability of defeasible argumentation as a compelling approach to reason with quantitative data and uncertain knowledge.

artificial intelligence, defeasible argumentation, natural language, (2 more...)

doi: 10.1016/j.inffus.2022.08.025

2206.13959

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)

#artificialintelligenceNov-26-2021, 07:00:18 GMT

Complete Beginner's Guide to Analytics

There's no one magic way to create an experience that will be universally and automatically loved. That's not the goal--rather, we seek to create experiences that will intuitively work for and delight a specific target audience. That's where analytics comes in. If you can't measure it, how will you know if it was successful? This is the question that drives UX practitioners to collect and analyze data, while protecting it with management services like the ones at https://www.couchbase.com/pricing.

analytic, analytic expert, ux practitioner, (13 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Storytelling (0.41)

#artificialintelligenceOct-20-2021, 17:50:21 GMT

Qualitative Data Can Provide Context and Meaning to Your Quantitative Data

Someone once said "if you can't measure something, you can't understand it." Another version of this belief says: "If you can't measure it, it doesn't exist." This is a false way of thinking -- a fallacy -- in fact it is sometimes called the McNamara fallacy. This mindset can have dire consequences in national affairs as well as in personal medical treatment (such as the application of "progression-free survival" metrics in cancer patients, where the reduction in tumors is lauded as a victory while the corresponding reduction in quality of life is ignored). Similarly, in the world of data science and analytics, we are often drawn into this same way of thinking.

provide context, qualitative data, quantitative data, (13 more...)

Genre: Research Report > New Finding (0.35)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.55)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.97)

#artificialintelligenceJul-29-2021, 12:19:17 GMT

A Brief Introduction to the Concept of Data - KDnuggets

Bio: Angelica Lo Duca (Medium) works as post-doc at the Institute of Informatics and Telematics of the National Research Council (IIT-CNR) in Pisa, Italy. She is Professor of "Data Journalism" for the Master degree course in Digital Humanities at the University of Pisa. Her research interests include Data Science, Data Analysis, Text Analysis, Open Data, Web Applications and Data Journalism, applied to the fields of society, tourism and cultural heritage. She used to work on Data Security, Semantic Web and Linked Data. Angelica is also an enthusiastic tech writer.

information, qualitative data, quantitative data, (14 more...)

Country: Europe > Italy > Tuscany > Pisa Province > Pisa (0.25)

Industry: Information Technology > Security & Privacy (0.91)

Technology:

Information Technology > Security & Privacy (0.91)
Information Technology > Artificial Intelligence > Natural Language (0.70)

#artificialintelligenceDec-14-2020, 00:55:13 GMT

How to Improve UX with AI and Machine Learning - Unthinkable

With the rapidly changing face of technology, AI has indeed reshaped the digital world. AI has created a positive impact on diverse sectors like finance, healthcare, retail, and more. But before we delve into how AI and ML are improving UX, let's have a look at what exactly does UX mean. User Experience (UX) encompasses all the aspects of the end user's interaction with the company, its services, products, and overall customer journey. The most crucial requirement for a great UX is meeting the exact customer needs and understanding their behavioral patterns.

ai and ml, information, user experience, (13 more...)

Industry:

Health & Medicine (0.36)
Information Technology (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceSep-6-2020, 09:26:14 GMT

Data Types in Statistics Used for Machine Learning.

The field of statistics is the science of learning from data. Statistical knowledge helps you use the proper methods to collect the data, employ the correct analyses, and effectively present the results. Statistics allows you to understand a subject much more deeply. To become a successful Data Scientist you must know our basics. Math and Stats are the building blocks of Machine Learning algorithms.

artificial intelligence, machine learning, statistics, (13 more...)

Country: Asia > India (0.05)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)